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Abstract. We present a short overview on the strongest variational formulation for gradi- 
ent flows of geodesically A-convex functionals in metric spaces, with applications to diffusion 
equations in Wasserstcin spaces of probability measures. These notes are based on a series of 



lectures given by the second author for the Summer School "Optimal transportation: Theory 
f*l and applications" in Grenoble during the week of June 22-26, 2009 
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Introduction 

These notes are based on a series of lectures given by the second author for the Summer School 
"Optimal transportation: Theory and applications" in Grenoble during the week of June 22-26, 
2009. 

' We try to summarize some of the main results concerning gradient flows of geodesically A- 

convcx functionals in metric spaces and applications to diffusion PDE's in the Wasserstein space 
of probability measures. Due to obvious space constraints, the theory and the references presented 
here are largely incomplete and should be intended as an over-simplified presentation of a quickly 
evolving subject. We refer to the books [3l[68] for a detailed account of the large literature available 
on these topics. 

[~ — ' In the first section we collect some elementary and well known results concerning gradient flows 

C*~) . of smooth convex functions in R d . We selected just a few topics, which are well suited for a 

"metric" formulation and provide a useful guide for the more abstract developments. 

In the second section we present the main (and strongest) notion of gradient flow in metric spaces 
ON ' characterized by the solution of a metric evolution variational inequality: the aim here is to show 

the consequence of this definition, without any assumptions on the space and on the functional 
(except completeness and lower semicontinuity) : we shall see that solutions to evolution variational 
inequalities enjoy nice stability, asymptotic, and rcgularization properties. We also investigate the 
relationships with two different approaches, curves of maximal slope and minimizing movements, 
and we discuss a first stability result with respect to perturbations of the generating functional 
' with respect to T-convcrgence. 

The third section is devoted to some fundamental generation results for gradient flows of geodesi- 
cally A-convex functionals: here we adopt the method of minimizing movement to construct suit- 
able families of discrete approximating solutions and we show three basic convergence results. 

Apart from Sections 12.61 (stability of gradient flows with respect to T-convergence of the func- 
tionals) and 13.11 (existence of curves of maximal slope), we made a substantial effort to avoid any 
compactness argument in the theory, which is mainly focused on purely metric arguments. So 
we will present a slightly relaxed version of the minimizing movement scheme, which is always 
solvable by invoking Ekcland's variational principle, and the main existence and generation results 
for A-gradient flows rely on refined Cauchy estimates and crucial geometric assumptions on the 
distance of the metric space. 

The last section is devoted to applications of the metric theory to evolution equations in the 
so called "Wasserstein spaces" 2?i(lC) of probability measures. We recall a few basic facts about 
such spaces, the characterization of geodesies and absolutely continuous curves, and some geo- 
metric properties of the Wasserstein distance. Three basic examples of (or, better, displacement-) 
A-convex functionals in ^(B^) are presented, together with the evolutionary PDE's they are 
associated with. A short account of possible extensions of the theory to measure-metric spaces 
concludes the notes. 
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1. Gradient flows for smooth A-convex functions in the Euclidean space 

In this section we recall some simple properties of the gradient flow of a C 2 function ip : R d — > M 
satisfying the global lower bound D 2 (^ > XI for some A 6 R. We will focus on those aspects which 
rely just on the "metric" structure of R d and therefore could make sense in more general metric 
spaces. We denote by d(u, v) = \u — v\ the Euclidean distance on R d induced by the scalar product 
(v>- 

Remark 1.1 (A few basic facts about A-convex functions). We will extensively use the following 
well known equivalent characterizations of a A-convex function <p : W l — > R (here x,xq,xi are 
arbitrary points in R d ) 

Hessian inequality: 

B 2 (p(x) > XI i.e. (D 2 tp(x)£,Q > A|£| 2 for every £ G R d . (1.1a) 
A-monotonicity of Vip: 

(V^(i ) - V<p(xi),x - xi) > X\x - xi\ 2 . (1.1b) 
A-convexity inequality: 

tp(x ) < {l-6)ip{x ) + 6ip(x x )~6(l- 6)\x - x x \ 2 x e := (1 — 0)xq + Oxi, 6 G [0,1]- (1.1c) 
Sub-gradient inequality: 

(V<p(xx),xi - x ) - ^\xi - x \ 2 > <p(xi) - <p(x ) > (Vip(x ),xi - xq) + ^\xi - x \ 2 - (l.ld) 
Notice that 

p> is A-convex if and only if (p{x) := f(x) — ~^\ x \ 2 is convex. (1-1°) 
In particular, there exist constants a G R, b G M. d such that 

<p(x) >a+(b,x) + ^\x\ 2 . (l.lf) 

Definition 1.2 (Gradient flow). The gradient flow of ip is the family of maps 

S t :R d ^R d , iG[0,+oo), 

characterized by the following property: for every uq G R d , So(uo) := uq and the curve Ut ■= St(uo), 
t G (0, +oo), is the unique C 1 solution of the Cauchy problem 

—Ut = —Vip(ut) in(0, +oo), \imut = uq. (1-2) 
at tio 

By the standard Cauchy-Lipschitz theory and the a priori estimates we will show in the next 
theorem, for every initial datum uq G R d equation (|1.2|) admits a unique global solution so that 
the family St, t G [0,+oo), is a continuous semigroup of Lipschitz maps, thus satisfying 

S t+h (u ) = S t (S h (u )), limS t (-u ) = S (u ) = u Q for every uq G R d . (1.3) 
1.1. Basic estimates. 

Theorem 1.3 (Basic differential estimates). Let us assume that ip G C 2 (M. d ) is X-convex; if 
u : [0, +oo) — > R d is a solution of (jl.2[) then 

d 1 , r0 . A, l9 _ w d 



dt 2 



k - tf + ^|«f - <f - e ~ A *^( eAt ^ M * ^ v \ 2 ) ^ - f° r ever v v e Rd > ( evi a) 
^K) = -KI 2 —|v^ t )l 2 <o, (ei) 
A( c 2 ^|v^ t )| 2 ) = A( c2 A tK |2)< . (SIa) 
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moreover, if v is another solution to (|1.2|) then 

A(e At K-t- t |) < 0. (Cont A ) 

Proof. We sketch here the easy calculations. 
For the evolution variational inequality QEVIaI ): 

d 1 l2 . . . OJ / \ \ / \ , s A. l2 

d^'"*""' = \u t ,u t -v) = {Vip(u t ),v -u t ) < (p{v) - if(u t ) - -\u t - v\ . 



The energy identity (|EI|) : 



The s/ope inequality ( |SIa D ■ 

^|V^( Ut )| 2 = 2(DV(«t)V^(ut),ui> ® -2(DV(«t)V^(w t ), V^(«t)) -2A|V^ t )| 2 - 
The X-contraction property ( |Coiit A [ ): 

d IU~2t ll.lt>> 

— \u t - v t \ 2 = 2{u' t - v' t ,u t - u t ) = -2(V(^(u t ) - Vtp(v t ),u t - v t ) < -2A|u t - u t | 2 . □ 
In order to write in a simple way suitable integrated versions of the previous inequalities, we 



set 



Corollary 1.4 (Pointwise and integral inequalities). If u : [0, +oo) — s- R d is a solution to (|1.2[) 
then 

, 2 



C 



A/ 



EaW) ' , , |2 i 



— |«t - u| a + E A (t)(p(ut) - p(t>)) + V ^ ^ |Vy( Mf )| 2 < -K - «| a for every v g M d , (1.5) 

^{ut) + \J (|<| 2 + |V^K)| 2 )dr = ^( Uo ), (1.6) 

|V V (ttt)| < e- At |V^(«o)|; (1-7) 

moreover, if v is another solution to (|1.2[) £/ien 

|«t-«t| <c- At |u -wo|- (1.8) 
In particular, when A > 0, ip admits a unique minimum point u and 

-\u t - u\ 2 < <p(ut) - <p(u) < ^|V^(u t )| 2 (1.9) 

\ut-u\ < c- xt \u ~u\, ip(u t )-if{u)<c- 2Xt ((p(u )~^{u)). (1.10) 

Proof. We have just to check (| 1 . 5[) : if A t denotes the quantity in the left-hand side, we show that 
At is nonincreasing. A differentiation in time yields 

-A t = e At (^| Mt - v\ 2 + ~\ut- v\ 2 + ¥>(«*) - <p{v) + E A (t)|V^(u t )| 2 ) 

+ E A (t)A^ )+ (!^l)!^|v^K)i 2 ) 

V E A (i)(c A '|V^)| 2 + ±<f(u t ) + ^^IV^OI 2 ) 
® E A (i)((c A < - 1)|V^K)| 2 + ^^|V^ t )| 2 ' 



¥E A (i)((c At -l-AE A (t))|V^K)| 2 ) ^0. □ 
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In terms of the maps S t , (|1.8[) yields the A-contraction estimate 

d(S t (uo),S t («b)) < c- M d(u ,v ) for every u ,v G M. d , t > 0, (1.11) 

thus showing the Lipschitz property of St and the uniqueness and continuous dependence w.r.t. 
the initial data of the solutions of (11.21). 



1.2. Approximation by the Implicit Euler scheme. One of the simplest but very useful 
ways to construct discrete approximations of the solution to fj 1 . 2|) (and to show its existence by a 
limiting process) is given by the implicit Euler scheme. 

For a given time step r > we consider the associated uniform partition of [0, +oo) 

TV := {0 = t° T <tl< ... < i™ < ...}, t n T := nr, (1.12) 

and we look for a discrete sequence (C/") raS N whose value U™ should provide an effective approxi- 
mation of u(£"). t/™ arc defined recursively, starting from a suitable choice of fZj? ~ uq, by solving 
at each step the equation of the Euler scheme 

jjn _ jjn-l 

T T = -Vyffl?) n = l,2 ) ---, (1.13) 

T 

or, equivalently, 

U? = J r (Wr 1 ), J t -={I + tV V )-\ (1.14) 

Existence of a discrete approximating solution can be easily obtained by looking for the minimizers 
of the function 

U i ^ $(r, l/?" 1 ; [/) : = -L|[/ - [Z^ 1 1 2 + tp(U). (1.15) 
2r 

In fact, it is immediate to check that any minimizer U™ of (|1.15[) solves ()1 . 13|) : moreover, the 
function defined by (|1.15[) is (t^ 1 + A)-convex and therefore it admits a unique minimizer whenever 
r- 1 > -A. 

Denoting by U T : [0, +oo) — » M. d the piecewise linear interpolant of the discrete values (£/") n gN 
on the grid 3V, defined by 

t — t n ~ l t n — t 

U T (t) := ^U?- 1 + -E (7 T n if t g ( L1 6) 

r t 

one expects that C/ T (t) converges to the solution u t to (|1.2j) as r | 0. 

Theorem 1.5. 7/lim T ^oC/|? = uo £/ien the family of piecewise linear interpolants {U t ) t> q satisfies 
the Cauchy condition as r \. with respect to the uniform convergence on each compact interval 
[0, T], T > 0; its unique limit is the solution u t of (|1.2p . Moreover, for every T > there exists a 
universal constant C(A, T) smc/j £/ia£ 

sup \u t -U r (t)\<\u -U?\+C(\,T)\V<p(u )\T. (1.17) 
te[o,T] 

In particular, when A = we can choose C = , independent of T . 

Remarks about the proof. In the present finite dimensional smooth setting, the proof of the con- 
vergence of U T is not difficult: considering e.g. the case A = 0, we can apply the contraction 
property of the map J T defined by (| 1 . 14[) 

\J T (x) - J T (y)\ < \x - y\ for every x,yeW l , (1.18) 

to obtain the uniform bound 

t-^U^-U?-^ = \V<p(U?)\ < IV^IT™- 1 )! for every n > 1, (1.19) 

so that 

\U' T {t)\ Ksu-pr-^U? -U?-^ =t- 1 \U 1 t -U*\ < \V<p(U°)\ for every te [0,+oo)\V T . (1.20) 
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Since lim T ^o |Vy(t/^)| = |Vy(uo)| it follows that (U T ) T> o satisfies a uniform Lipschitz condition 
and therefore it admits a suitable subsequence uniformly converging to a Lipschitz curve u in each 
compact interval [0,T]. Denoting by U T (t) the piecewise constant interpolant 

U T (t):=U r T l if t £ {^-\t% (1.21) 

the same estimate (|1 .20[) shows that 

sup \U T (t)-U T (t)\<r\Vip(U°)\, (1.22) 
te(o,+oo) 

so that U T has the same limit points than U T . On the other hand, (|1.13|) yields 

U' T {t) = -V V >{U T {t)) in [0,+oo)\D> r> (1.23) 

and we can pass to the limit in an integrated form of (|1.23|) thus showing that u solves p. 21) . 

The uniform error estimate (|1.17l) is subtler: a simple derivation in the case A = can be found 
in [5T], see also [551 [SI]. Its main functional interest relies on the fact that it involves just the 
lower bound on the Hessian of ip but not its upper bound (and therefore, it does not require a 
uniform Lipschitz condition on V</>). □ 

1.3. Metric characterization of Gradient flows in M. d . The energy identity (|EI[) (with his in- 
tegrated version p.6p ) and the evolution variational inequality ( |EVI>,D not only provide important 
estimates on the solution to (|1.2[) but can also be used to characterize it. 

Concerning (|EI[) we can even relax the identity, as the following proposition shows. 

Proposition 1.6 (Curves of maximal slope). A C 1 curve u : [0, +00) — > M. d is a solution to (|1.2[) 
if and only if it satisfies the Energy Dissipation Inequality 

^V(ut) < -5KI 2 - ^|V^K)| 2 zn(0,+co) (EDI) 
or its weaker integrated form 

<P( u t) + \f (l<| 2 + |V<^K)| 2 )dr <tp(u ) for every ie(0,+oo). (EDI') 

Proof. II u is a C 1 curve the chain rule yields 

<p(u t ) = tp(u ) + f (V<p(u r ),u' r )dr, (1.24) 



so that (|EDF|) yields 

' |< + V(^K)| 2 dr = 1 j (K| 2 + |V(^K)| 2 )dr+ ( (Vvj(«t)X> dr < 0, 



n 



and therefore u' r = —\7ip(u r ) for «5f 1 -a.e. r £ (0, i). Since t is arbitrary and u £ C 1 , u solves 
(O). □ 



Notice that in the previous formulation we did not use the A-convcxity of ip: the argument only 
relies on the chain rule. 

In the following proposition we show that also the evolution variational inequality QEVIaD 
characterizes a solution of (|1.2[) . In fact, if ( |EVI>,D admits a solution for every initial datum uo, 
then ip is A-convex. 

Proposition 1.7 (Characterization of Gradient Flows through the EVI). If u : [0, +00) —> M. d is 
a C 1 curve solving ( |EVIa[ ) then u is a solution to (|1.2p . 

Proof. Applying the chain rule for the squared distance function i| • — v\ 2 we easily have 

(u' t ,u t - v) < (p(v) - ip{ut) - ^\u t - v\ 2 for every v £ R d , t > 0. (1-25) 
Choosing v := Ut + e^, for e > and ^ £ M. d and dividing by e we obtain 

-K,0 <e _1 (v(«*+eO-v(«t)) - ylCI 2 for every e £ M d . 
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Passing to the limit as e I we eventually get 

-K,£) < (V V («t),0 for every £ g M d , 
so that — itj = Vy(ut). □ 

Proposition 1.8. Lef us suppose that there exists a C 1 semigroup St : K d — >• K d , i > 0, of smooth 
maps such that for every uq g IR d tte curve Ut '■= St(uo) satisfies flEVI.\| ). TTien 95 is X-convex. 

Proof. We consider for simplicity the case A = 0; for arbitrary it , u 1 g M. d we set 

u s := (1- s)u° + SU 1 , u s t -~S t (u s ) 

and we want to show that 

c^)| s =o * £<rf«*)U- 

We get 

d .^„.sv /v7..^.on ..1 ..o\ EU /d__ 0i _, . d/1. 



t=0 



JevO JevO H / 1 \ OJ H 

< ^u 1 )- ¥>(«") < --(-\ u °-ul\ 2 ) lt=0 < (V<p(u 1 ),u 1 -u°) = -<p(u s )\ s=i n 

1.4. Extensions to more general functional settings. The simple finite dimensional theory 
for smooth functionals has been extended in various directions; without claiming any completeness, 
we quote here four different points of view: 

The theory of differential inclusions and maximal monotone operators in Hilbert spaces, developed 
in the seventies by Komura [39], Crandall-Pazy [25], Crandall-Liggett [24], Brezis [13], 
Benilan [10], J.L. Lions [40]: we refer to the monographs [III [8] [40]. In this framework one 
considers the gradient flow generated by a proper lower semicontinuous A-convex functional <p '■ 
H — > (—00, +00], where H is a separable Hilbert space. By using tools of convex analysis, clever 
rcgularization techniques, and replacing Vip with the multivalued subdiffercntial operator d<f>, one 
can basically reproduce all the estimates and results we briefly discussed in the finite dimensional 
setting which just depend on the lower bound of the Hessian of tp, avoiding any strong compactness 
assumptions. 

In this framework, the resolvent operator J T := (I + rd(j))~ 1 is single-valued and non-expansive, 

i.e. 

6(J T [u],J T [v\)<6(u,v) for every u, v g if, r > 0. (1-26) 

This property is the key ingredient to prove, as in the Crandall-Liggett generation theorem 
|24) , uniform convergence of the exponential formula 

u t = lira (J t/ „rM, dK (J t/n )>oD < 2m ^ o)t (1.27) 

n-s-+oo ,/n 



and therefore to define a contraction semigroup on D(cf>). 

Being generated by a convex functional, this semigroup exhibits a nice regularization effect, since 
Ut g D(d4>) even if u$ g D{4>). Moreover, the curve it* can be characterized as the unique solution 
of the evolution variational inequality ( |EVIa| ), whose formulation goes back to [41] . Optimal 
error estimates for the implicit Euler discretization in the spirit of (|1.17|) have been obtained by 

ESI El E]. 

The theory of the curves of maximal slope in metric spaces, developed in the eighties by De 
GiORGi, Degiovanni, Marino, Tosques in a series of papers originating from [501 [H], and 
culminating in [3JJ [31] (but see also the more recent [TB] and the presentation of [U [3])- Here 
<f> : X — > (—00, +00] is a proper and lower semicontinuous functional defined in the complete 
metric space X and one looks for absolutely continuous curves satisfying a suitable form of the 
Energy dissipation inequality (|EDI[) . where \u'\ should be interpreted as the metric velocity of the 
curve u and |Vy(u)| should be replaced by the metric slope of </>. The theory is usually based on 
local compactness of the sublevels of <f> and various kind of assumptions on its slope, yielding in 
particular its lower semicontinuity and the possibility to write a weak form of the chain rule. The 
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advantage of this approach relies on its flexibility, but in general metric spaces uniqueness and 
stability properties of curves of maximal slope are not known. 

Limits of discrete solutions, generalized minimizing movements. This is the weakest approach, 
which has been clarified in |28j and independently applied to different kind of problems (see 
e.g. [43], [35], [36], [48]). It just provides a general approximating scheme which is quite useful 
to construct some limit curves by compactness arguments, but one can hardly deduce refined 
properties of these curves from general metric results and each example deserves a careful ad hoc 
investigation. 

Evolution variational inequalities in metric spaces: this is the strongest point of view, which is 
related to the metric evolution variational inequality dEVI^D and goes back to Benilan [TU] 
notion of integral solutions to evolution equations in Banach spaces. Its application to gradient 
flows in metric spaces has been developed in [3] and it will be adopted in these notes. 

2. Gradient flows and evolution variational inequalities in metric spaces 

The aim of this section is to study the metric notion of gradient flows associated to the (metric 
formulation of the) evolution variational inequality flEVI,\| ). 

Throughout the rest of these notes, [X, d) will be a complete and separable metric space and 
(j) : X — > (— oo, +oo] a proper and l.s.c. functional on X with non empty domain D(<p) = {v G X : 
(f>(v) < +oo}. We will look for curves u : [0, +oo) — > X which satisfy properties that depend only 
on the metric structure of X and that in the case of a smooth function 4> = f ° n X = R d satisfy 
the ODE (TOIl. 



2.1. A few metric concepts. Let us first recall the notion of metric velocity and metric slope 
(see e.g. [5]). 

Definition 2.1 (Absolutely continuous curves). We say that a curve v : (a, b) C R — > X belongs 
to ACS, (a, b; X) for some p£ [1, +oo] if there exists m G ^Yi oc ) ( a i such that 

6(v a , vt) < / m(r) dr for every a < s < t < b. (2-1) 



If p = 1 we say that v is a (locally) absolutely continuous curve. 

Theorem 2.2 (Metric derivative). If v : (a,b) — > X is an absolutely continuous curve then the 
limit 

KK^lim^l ( 2 .2) 
s->t \t — s\ 

exists for -a.e. t G (a, b) and it is called metric derivative of v at the point t. Moreover, the 
function t i— ^ belongs to L l (a,b), it is an admissible integrand for the right hand side of 

(|2.ip . and it is minimal in the following sense: 

\v'\(t) < m(t) for J^ 1 -a.e. t € (a, 6), for each function m satisfying (|2.ip . 

Definition 2.3 (Metric Slope). The metric slope of (p at a point v G X is given by 

oo ifv^D{4>), 
,0 if v G D(6) is isolated, fn oN 

(0(w)-0H) ,, 

limsup^ r — - — otherwise. 

w -> v 6{v,w) 

2.2. Structural properties of solutions to Evolution Variational Inequalities. The next 
(quite restrictive) definition is modeled on the case of A-convex functionals in Euclidean-like spaces 
and has been introduced and discussed in [3] Chap. 4]. 

Definition 2.4 (EVI and Gradient flow). A solution of the evolution variational inequality 
EVIa(A, d, (j)), A G R, is a locally absolutely continuous curve u : t G (0, +oo) <— > u t G D((f>) 
such that 

i-^d 2 (w t ,w) + ^d 2 (u t ,v) < <j){v) - (f>(ut) ^-a.c. in (0,+oo), for every v G D(<j>). (EVI A ) 
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A X-gradient flow of (p is a family of continuous maps S t '■ D(<fi) — > D (</>), t > 0, such that for 
every u £ D(4>) 

limSt(w) = u =: Sq(w), St+/j(w) = S/,(St(ti)) for every t, h > 0, (2.4a) 

i/ie curve 1 1— > S t (u) is a solution of F,VI\(X 1 d, 0). (2.4b) 

The next result shows that QEVIaD can be formulated avoiding differentiation and without 
assuming the absolute continuity of u (see [60] for the proof). 

Theorem 2.5 (Derivative free characterization of solutions to QEVIaI ))- A curve u : (0, +oo) — ^ 
.D(</>) is a solution of KVl\(X,d, (f>) according to Definition \2.4\ if and only if for every s,t £ 
(0, +oo) wif/i s < t and v £ D(<f>) 

^--ld 2 (u tl v)-^d 2 (u s ,v) < E A (t- (EVI' A ) 



Notice that ( EVI A I yields the pointwise right- upper differential inequality 
1 d^ A 

-— d 2 (u t ,v) + -d 2 (u t ,v) < <f>{v) - 4>{u t ) for every v £ D{4>), (2.5) 

at every time t > 0: here ^ £ denotes the right-upper Dini derivative limsup^ h~ 1 (( t (t+h) — ( t (t)). 

The next result collects many useful properties of solutions to EVI,\(X, d, 4>) (see [60] and an 
analogous result of [4] in the Wasserstein framework) : they reproduce in the metric framework the 
estimates of the previous section and show that QEVIaD contains all the information concerning 
the gradient flow of <j>. 

Theorem 2.6 (Properties of solutions to QEVIaI )). Let u,?/ 1 ,-!/ 2 : [0,+oo) — >• X be solutions of 
EVI A (X,d,0). 

A-contraction and uniqueness: 

d(u],u 2 ) < c- x{t - s) d{ul,u 2 ) for every < s < t < +oo. (2.6) 



In particular, for every uo £ D(<f>) there is at most one solution u o/ EVI A (^, d, </>) satis- 
fying the initial condition lim^o Ut = Uq . 
Regularizing effects: u is locally Lipschitz continuous in (0, +00) and u t £ D(\d<f>\) C 
D((j}) for every t > 0. Moreover in the time interval [0, +00) 

the map 1 t— > <f>(u t ) is non-increasing and (locally semi-, if X < 0) convex, (2-7) 

the map is non-increasing and right continuous, (2-8) 

the following regularization/a priori estimate holds 

C ^d 2 {u u v) + E A (t)(^(«t) - <f>(v)) + ^^m 2 ^) < id 2 K, v) (2.9) 
for every v £ D (</>); in particular 

0K)<^) + ^yd 2 (u o , V ), (2.10) 

|^| 2 ( Ut ) < ^-J—^-l^l 2 ^) + __l_d 2 ( Wo ,^) */-At<log2. (2.11) 

Asymptotic expansion for 1 10: If uq £ D(\d<p\) and X < then for every v £ D((f>) and 
t > 

—d 2 (u u v) - -d 2 (u ,v) < E 2X (t)(<t>(v) - + 2"|^| 2 K). (2.12) 
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Right and left limits, energy identity: For every t > the right limits 

d(u t ,u t+h ) d 4>{u t+h ) - 4>{ut) , 

\u t+ \:= hm , — <p(u t+ ) := hm r (2-13) 

Ho h dt hio h 

exist, they satisfy 

±cl>(u t+ ) = -\u t+ \ 2 = -\d<j>\ 2 (u t ), (2.14) 

and they define a right continuous map. (|2.13j) and (|2. 14[) hold at t = iff uq £ D(\d(j>\). 
Moreover, there exists an at most countable set 6 C (0, +oo) such that the analogous 
identities for the left limits hold for every t G (0, +oo) \ C. 
Asymptotic behavior: If X > 0, then 4> admits a unique minimum point u and for every 
t > to > we have 

^d 2 (u u u) < 0(ut) - #u) < ^|^| 2 (ut), (2.15a) 
d 2 (w f ,?2) < d 2 ( Mto ,u)e- A(t -* o) , (2.15b) 
<t>(u t ) - < (</>K) - 0(S))c- 2A ( t - t °), 0( Mt ) - 0(«) < ie^-^J^K.u), (2.15c) 

< |S0|K)e- A ^), |90|( Mt ) < * -. (2.15d) 

e-H 1 ti ^d(M to ,u) 

If X = and u is any minimum point of <f> then we have 

I-,,, n d 2 (u ,u) . . d 2 (u ,u) 

M(«t)< — — , *(««)-*(«)<— (216) 
i/ie map i 1— > d 2 (ut,u) is not increasing. 

Continuity of the energy and the slope: If u n G C°([0, +oo); X) are solutions of 
T£V\\{X, d, (f>) such that lim n ^ +oc Uq — uq, then 

lim 4>(u") = <p(ut) for every t > 0, (2-17) 

n^+oo 

Wuy \d<j)\{u^) = \d<j)\{u t ) for every t G (0, +oo) \ 6. (2.18) 

We jitsi sketch the proof of the contraction property (|2.6|) . For a fixed s € (0, +oo) we have that 
^ld 2 (uliL 2 s ) + ^d 2 (ul,ul) < tiul) - tiul) for every te(0,+oo), (2.19) 
while for a fixed < G (0, +oo) 

^- l -d 2 (u\,u 2 s ) + ^d 2 (ul,u 2 ) < <j>{u]) - <j>{u 2 8 ) for every a G (0,+oo). (2.20) 
OS 2 2 

Adding ([2~E)j) and ([2~2"0T) we get 

~4 2 {u\,u 2 s ) + ~d 2 (ul,u 2 s ) + Xd 2 (ul,u 2 s ) < 0; 
Applying 0| Lemma 4.3.4] we obtain 

^d 2 K\ u 2 ) < -2\d 2 (ul,u 2 ) JSP^a-e. in (0, +oo) 
and therefore we obtain (12.61). □ 



Theorem 12.61 concerns each single solution to QEVIa | ; when the A-gradient flow St of </> exists 
we have further interesting properties, showing that the formulation by QEVIaD is really stronger 
than all the other metric approaches. 



in 
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2.3. A-Gradient flows and A-convexity along geodesies. Let us first recall the notion of 
(minimal, constant speed) geodesies in a metric space X and the related convexity. 

Definition 2.7 (Constant speed geodesies). A curve 7 : [0, 1] — > X is a constant speed geodesic 
(or simply geodesic ) if 

d(7„7 t ) = |t-s|d( 7o ,7i) for every < s < t < 1. (2.21) 

A set D C X is geodesically convex if every couple of points xq,xi £ D can be connected by a 
geodesic 7 contained in D. 

Definition 2.8 (A-convexity along curves and geodesically A-convex functional). We say that 
<j) : X — > (—00, +00] is A-convex along a curve 7 : [0, 1] — > X if 

His) < (l-s)0(7 O ) + S 0(7 1 )-^ S (l-.s)d 2 (7 O ,7 1 ) for every s£ [0,1]. (2.22) 

We say that <f> is geodesically A-convex if every couple of points xq,xi £ D((j}) can be connected by 
a geodesic 7 along which <fi is X-convex. If (f> is geodesically convex and it is X-convex along every 
geodesic connecting Xq,X\ £ D(4>) in D((f>) then we say that (f> is strongly geodesically X-convex. 

Theorem 2.9 f[27|). If the X- gradient flow St of <j) exists then <j) is X-convex along any geodesic 
in D((f>). In particular, if D((j)) is geodesically convex, then </> is strongly geodesically X-convex. 



Proof. Let 7 : s £ [0 , 1] H> 7 s £ D{<f>) be a geodesic with 7 , 7 1 £ D(cj)) and let us set 7 t s := S t (7 s ). 
Applying (EVI' A I we have for every s £ [0, 1] and t > 

ie At d 2 ( 7t s ,7°) ~ ^d 2 (7 s ,7°) < E A (0(0(7°) - </>(lD), (2-23) 

ie At d 2 (7 t s ,7 1 )-^d 2 ( 7 s ,7 1 )<EAW(0(7 1 )-0(7 t s )). (2.24) 
Multiplying (|2.23j) by (1 — s) and (|2.24[) by s and adding the two inequalities we get 

^((l-.s)d 2 (7 t s ,7 ) + S d 2 ( 7t s ,7 1 ))-^((l- S )d 2 (7 s ,7 ) + 5 cl 2 (7 s ,7 1 )) 

< E A (t)((l - S )0( 7 °) + ■s0(7 1 ) - 0(7 t s ))- (2-25) 
We now observe that the elementary inequality 

(1 - s)a 2 + sb 2 > s(l - s){a + b) 2 for every a, b £ R, s€[0,l], (2.26) 
and the triangular inequality yield 

(l- S )d 2 ( 7t s ,7 ) + sd 2 ( 7t s ,7 1 ) > S (l- S )(d( 7t s ,7°) + d(7 t s ,7 1 )) 

> S (1- S )d 2 ( 7 °,7 1 ). (2.27) 

On the other hand, since 7 is a geodesic we have 

(1 - s)d 2 ( 7 s , 7 ) + sd 2 ( 7 s , 7 1 ) = s(l - s)d 2 ( 7 °, 7 1 )- (2.28) 

Inserting ([2T2T]) and (gUj) in (|2~2"5|) we get 

^As(l - S )d 2 ( 7 °,7 1 ) < E A (t)((l - S )0( 7 °) + ^(T 1 ) - 0(7t S ))- (2-29) 
Dividing then both sides of (|2.29l) by Ex (t) and passing to the limit as 1 1 we obtain 

^(7 S ) < (1 - s)^(7°) + s^(7 1 ) - - s)d 2 ( 7 °,7 1 ) for every * e [0, 1]. 



□ 
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2.4. A-gradient flows and curves of maximal slope. 

Definition 2.10 (Curves of maximal slope). We say that a curve u <G AC 2 oc (Q, +oo; X) is a curve 
of maximal slope for the functional <fi if the energy dissipation inequality 

X - J \u'\ 2 {r) dr + X - J \d<f>\ 2 (u r ) dr < - <f>(u t ) (2.30) 

holds for all < s <t < +oo . 

The notion of curve of maximal slope has been first introduced (in a slightly different form) by 
Dc Giorgi and provides a weak notion of gradient flow for nonsmooth functionals, also nonconvex. 
If cf> admits a A-gradicnt flow according to Definition 12.41 then these two definitions coincide. 

Theorem 2.11. Let us assume that the X-gradient flow S of <fi exists and let u € AC 2 oc (0, +oo; X) 
be satisfying (|2 . 30[) with lim t ^oitt ~ uq G D{4>). Then u t = St(ito) for every t > and (|2 . 30[) is in 
fact an identity for every < s < t < +oo . 

2.5. A-gradient flows and the minimizing movements variational scheme. A general vari- 
ational method to approximate gradient flows (and often to prove their existence) is provided by 
the so-called minimizing movements variational scheme. In his original formulation (see e.g. |28j). 
the method consists in finding a discrete approximation U T of the continuous gradient flow u by 
solving a recursive variational scheme, which is the natural generalization of (| 1 . 1 5|) to a metric- 
space setting. If t > denotes the step size of the uniform partition 3V (|1.12[) . starting from a 
suitable approximation U® of uq one looks at each step ((n — 1)t, tit] for the minimizcrs of the 
functional 

U i y $(r, U?- 1 ; U) := -U 2 (C/, + <f>(U). (2.31) 

It 

U T thus takes a value Z7™ G argmin$(r, [7 ri_1 ; ■) on each interval ((n — l)r, nr]. 

Definition 2.12 (The minimizing movement variational scheme). Let us consider a time step 
r > and a discrete initial datum U® £ D(4>). A T-discrete minimizing movement starting from 
U® is any sequence (£/™) n gN * n D(tfi) which satisfies 

$(r, U?- 1 ; U?) < $(t 3 U?- 1 ; V) for every V G X, neN. (2.32) 

A discrete solution U T is any piecewise constant interpolant of a T-discrete minimizing movement 
on the grid V T defined by 

U T {Q) = U a T , U T {t) = U? ift€(t?-\t%n>l. (2.33) 

The existence of a minimizing sequence {[/™} ra gN is usually obtained by invoking the direct 
method of the Calculus of Variations, thus requiring that the functional (|2.31j) has compact sub- 
levels with respect to some Hausdorff topology a on X (sec e.g. the setting of [3J § 2.1]). In the 
next section we will discuss another possibility, still considered in [3], when the functional (|2.31l) 
satisfies a strong convexity assumption. 

In a general setting it is also possible to avoid these restrictions by applying the Ekeland's 
Variational Principle to the functional (|2.31j) : this approach only requires the completeness of the 
metric space. 

Definition 2.13 (A relaxed minimizing movement variational scheme). Let us consider a time 
step t > 0, a relaxation parameter rj > 0, and a discrete initial datum G D((f>). A (t,7])- 
discrete minimizing movement starting from J/fL is any sequence ({/"^neN inD(<j)) which satisfies 

$(r, U?- 1 ; U? >v ) < $(r, U^ 1 ; V) + \ d(C/™ I) , U^ 1 ) d(V, U? t „) for every V G D(0), (2.34a) 
and the further condition 

$(r, U?- 1 ; V? iV ) = ^d 2 (^„, U?- 1 ) + < ^(C/;- 1 ), (2.34b) 

for every n G N. A (t,t))- discrete solution U Tri is any piecewise constant interpolant of a (t, r/)- 
discrete minimizing movement on the grid T T , as in (|2.33[) . 
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Notice that when 77 = a solution to (|2.34a[) is a minimizcr of (|2.31l) (and in particular satisfies 
(I2.34b[l ), so that the usual discrete solutions arising from the minimizing movement scheme are 
included in this more general relaxed framework. The next result |60j , which follows directly from 
Ekeland's variational principle, shows that the previous scheme is always solvable when 77 > 0. 

Theorem 2.14. Let us assume that X is complete and 4> is quadratically bounded from below, i.e. 
for some k , </3 G G M, o G X 

•H 2 -) + yd 2 ^: °) — $0 for every i£l (2.35) 

Then for every rj > 0, t > with r _1 > — kq, and C/f _ G D(<f>), the relaxed minimizing movement 
scheme admits at least a (r, 77) -discrete solution (C7™ ?? ) n gN. 

Since under the general assumptions of Theorem 12 . 141 the relaxed minimizing movement scheme 
admits a (r, r^-discrete solution U T v for fixed rj > and arbitrarily small step size t, it is natural 
to ask what its limit as t 4. 0. A first result in this direction is provided by the next theorem, which 
shows that the minimizing movement scheme is consistent with the definition of A-gradient flow 
12.41 Notice that in Theorem l2.15l we will assume a priori that the A-gradient flow of (j> exists to get 
the convergence of U Tr) ; in Section [3] we will discuss how to remove this strong assumption. Still 
it is sometimes useful to know that any A-gradient flow, no matter how it has been constructed, 
admits a uniformly converging discrete approximation, which exhibits nice variational properties. 

Theorem 2.15. Let us assume that there exists the X- gradient flow St of (j> according to Definition 
\2.J\ and that D((f)) is geodesically convex. Let t > 0, 77 > satisfy rj — A < and let the 
sequence (U" v ) n eN C D(<f>) be a (r, rj) -discrete minimizing movement with U® „ G D(\d(f>\). Setting 
a = a T . v := log(l + 2(A — t])t) we have the a priori error estimate 

d(S t (u ),U T ^t)) < d(u ,U° tn ) + c- aT VT7\d0\(U°, ri ) for every t G [0,T]. (2.36) 

Ln particular if for some r/ > and every r G (0,To) U Tf . is a family of (r, 77)- discrete solutions 
with „ = uq G D(\d<j)\), then lim T ^o U T „ (t) = St(uo) uniformly on every compact interval. 

Let us remark that 77 has been kept fixed in the previous convergence result, so that the 
coefficients a T . v = 57 log(l + 2(A — 77)7") in the estimate (|2.36j) arc uniformly bounded from below 
as r I 0. 

2.6. Stability of A-gradient flows under T- convergence. We conclude this section by showing 
a simple stability property of Gradient Flows with respect to perturbations of the generating 
functional <f>. Here we consider a coercive family of r-convcrging functionals (f> h : X — > (—00, +00], 
/i£N = NU {+00}, which are quadratically bounded from below, uniformly w.r.t. h: for some 
o G X, 4> , k G R they satisfy 

4> h (x) + yd 2 (.T,o) > 4> for every x e X, hen. (2.37) 

In the next definition we jointly recall the (sequential) notions of T-convergence and of coercivity 
|261 Def. 1.12]. For notational convenience, we will identify monotone subsequences (/i n )neN with 
their unbounded image H = {h n : n G N} C N; expressions like lim^g^f, liminfftgjj have an 
obvious meaning as limits for h f +00, h G H. 

Definition 2.16 (Sequential T(X, d)-convergence of coercive functionals). We say that (<p h )h^N is 
a coercive family of functionals T(X , A) -sequentially converging to a proper functional <p°° : X — > 
(—00, +00] if the following two conditions are satisfied: 

(1) For every infinite subset H C N and every bounded sequence (x h )h^H with sup /l6 ^ <fi h (x h ) < 
+00, there exists an infinite subsequence H' C H such that Ymv^H' % h = x°° G D(4>°°) 
and 

liminf 6 h (x h ) > S^ix 00 ). (2.38) 
hew 
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(2) For every x°° £ D(<fi°°) there exists a sequence (x h )h^n such that 

lim d{x h ,x°°) = 0, lim <j) h {x h ) = ^{x 00 ). (2.39) 

It is possible to prove that a coercive family of A-convex functionals (<f) )h&i T(X, d)-converging 
to 4>°° always satisfies the uniform lower bound (|2.37p . 
Let us now state our first convergence result |60j . 

Theorem 2.17. Let {4> h )heH be a coercive family of junctionals T(X,d)- converging to 0°° and let 
us assume that the X-gradient flows S h exist for every h £ N. Then the functional (f>°° admits a 
X-gradient flow S°° and for every sequence Uq £ D((f> h ) converging to ug° £ D(<j)°°) we have 

lim S£(uft) = Sf(«§°), lim h (S£(u£)) = (f>°°(S^(u^)) for every t > 0, (2.40) 

h'f+oc h^+oo 

locally uniformly on (0,+oo). 

Proof. Here we consider the simpler case when (j2.37[) holds for k q = 0; it is not restrictive to 
assume A < and <j) > 0. 

Step 1: uniform bounds. We set := S^(uq) and we fix a compact time interval [0,T], T > 0, 
a point o°° <E D{4> 00 ) and a corresponding sequence o h as in (|2.39|) . (EVI' A ) yields 

d 2 {u h t ,o h ) < (d 2 (u \o h ) +2E A (t)0' 1 (o' 1 ))e- At ; (2.41) 
and therefore there exists a constant C\ (T) independent of h such that 

d(u%,o h ) < Ci(T) for every i € [0,T], h £ N. (2.42) 
The regularizing estimate (|2.9[) yields 
e At fE^lV^ 2 1 

— d 2 K,o' 1 ) + E x (t)^(u*) + l^L|^>f K fe ) < ^'("O' ) + E A (t)*V) < C 2 (T) (2.43) 

if t £ (0,T], for a suitable constant C2(T) independent of ft.. 

In particular for every < 5 < T there exists a constant C(5, T) such that 

fe (^)<C(5,T), = < C(S,T) for every t £ [S,T]. (2.44) 

5fep 2: compactness. By the estimates of the previous point, the sequence (u )he^ is uniformly 
Lipschitz in each bounded interval [S,T] of (0, +oo) and for every fixed t {w^j-heN satisfies the 
assumptions of Definition ^. 16[ so that (u^)h^N is relatively compact in X. Applying Ascoli-Arzcla 
theorem we can find a subsequence H = (ft. T i)neN such that u hn converge locally uniformly in time 
to a locally Lipschitz curve u°° in (0, +oo). 

Step 3: characterization of the limit. Let us now fix an arbitrary point v°° £ D(<p°°) and a 
corresponding approximating sequence v £ D(<fi h ) as in ()2.39j) . By (EVI A I of Theorem 12.51 we 
know that 

2^d 2 (^^)-id 2 (u^^)<E A (t- S )(^(^)-^(^)); (2.45) 

We then pass to the limit in (|2.45l) as h t +oo, h £ H, using the facts that u\ converges pointwisc 
to Ut in X and applying (|2.38|) for u\ and (|2.39[) for v ; we obtain 

^^d 2 (ur,0 - id a (u~ O < E A (t - «)(#«°°) - </>«)) (2.46) 

for every v°° £ D{<jf°), < s < t. A further application of Theorem 12.51 shows that u°° solves 
EVI A (X,d,0°°). 

In order to check that lim^o Ut° = u o° we use P-46|l at s = and the lower semicontinuity of 
(f)°° , which yields 

limsupd 2 ^ ,^ 00 ) < d 2 (0°°) for every v°° £ D^ 00 )- (2.47) 
40 



since wg° £ D{4>°°) we conclude that lim^o d(wf 3 , ug°) = 0. 

Since the limit is the unique solution of EVI A (A, d, (f>°°) starting from itg°, we conclude that 
the whole sequence u h converge to u°° . 
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Step 4.: convergence of energy. We argue as in the proof of (|2.17[) and (|2.18[) : for a fixed t > 
and applying (|2.39l) to uf we find a sequence (u^)heN converging to uf 2 with lim^ +00 <j) h {u^) = 
(j>°°(u%°). By estimate (|2.43|) . the slope \dcf> h \(Uf) is uniformly bounded by a constant M t so that 

<f{u\) > <j> h {u h t ) - M t d(u* «*) - \d 2 {u h tl u h t ). 

Passing to the limit as h j" +00 we get limsup^ +00 4» h (ut) < oo (m£ , °), which combined with 
(|2~38]l yields the second identity of (f2~40|) . □ 

3. Convergence of the minimizing movement method and generation results 

We have seen in Theorem l2.15l a first convergence theorem for the relaxed minimizing movement 
method: it basically says that if <f> admits a X-gradient flow according to Definition 12.41 then any 
family of discrete solutions converges to the unique continuous solution of QEVIaI ) as the time step 
converges to 0. 

In this section we revert this point of view and we try to prove the existence of the X-gradient 
flow when <f> is geodesically A-convex by studying the convergence of the (relaxed) minimizing 
movement Method. 

In the following we present three different results in this direction: 

(1) A simpler convergence result when the sublevels of </> are locally compact: in this case we 
avoid any geometric restriction on the distance d of X and we do not need any Cauchy 
estimate. On the other hand, the (not necessarily unique) limit points of the discrete 
solutions are just curves of Maximal Slope, according to Definition 12.101 in general it is 
not possible to prove that they solve ((EVTa). 

(2) A first generation result for A-gradient flows, by assuming that the minimizing movement 
generating functional $(r, U ; V) defined by (|2.31|) satisfies a suitable convexity property 
(which results from the combination of the convexity of d 2 and of <j>). 

(3) A second generation result when d 2 (-,u) is semiconcave along geodesies and the metric 
space satisfies a local angle condition between triple of geodesies emanating from the same 
point. 

Differently from the first approach, the last ones provide explicit Cauchy estimates ensuring the 
convergence of the method and do not require any local compactness of the sublevels of cf>. 

3.1. Convergence of the variational scheme in the locally compact case. Let us first 
consider the case when <fi is geodesically A-convex and its sublevels are locally compact, i.e. 3 o 6 X 
s.t. 

I ir G X : 4>(x) < R and d(x,o) < i?| arc compact in X for every R > 0. (3-1) 
Combining j3j Proposition 2.2.3, Corollary 2.4.11] we get 

Theorem 3.1 (Limits of discrete minimizing movements are curves of maximal slope). // <f> is 

geodesically X-convex and satisfies (|3.1j) then for every r > satisfying r _1 > —A and G D(<p) 
the minimizing movement variational scheme admits at least one solution ([/™)„ 6 n- If moreover 

limU° = u 0l )im<i>(U°) = <i>(uQ), (3.2) 

and U T is a family of discrete solutions, any infinitesimal sequence of time steps t„ ], admits a 
convergent subsequence (still denoted by r n ) and a limit curve u £ AC^ Q c ([0, +00); X) such that 

lim U T (t) = u t , lim <p(TL (i)) = 4>{u t ) for every t>0 (3.3) 

uniformly in each compact interval [0,T]. u is a curve of maximal slope (see Definition \2.10\) , 
satisfying the energy identity 

i / \u'\ 2 (r)dr+^ [ \d<t>\ 2 {u r )dr = <t)(u s )-<t){ut) for all < s < t < +00. (3.4) 
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Corollary 3.2 (Existence of Curves of Maximal Slope). Under the same assumptions of the 
previous theorem, for every uq € D(4>) there exists a curve of maximal slope u £ AC 2 oc ([0, +00); X) 
starting from uq and satisfying (|3.4[) . 



3.2. Generation of A-gradient flows by strong convexity of $. In the case when X is an 
Hilbcrt space and is a l.s.c. A-convex functional, it is well known that the minimizing movement 
variational scheme admits a unique solution and the corresponding discrete solution U T converges 
to the solution of flEVI,\D . Applying this approximation scheme, it is then possible to show the 
existence of the A-gradient flow of <j> according to Definition 12.41 

Similar results for minimizing movements of convex functionals in Banach spaces do not always 
hold: indeed, the characterization of gradient flows through the EVI depends not only on the 
convexity of (f> but also on structural properties of the distance d. 
One fundamental property is the 1-convexity of the function v 1— ^d(v,w) 2 , i.e. 

d 2 (« s , to) <(1 - s)d 2 (u ,«;) + sd 2 (v 1; to) - s(l - s)d 2 (v ,v 1 ) 

for every t>o,t>i, for every [0, 1] 9 s 1— > v s geodesic between vq and v±, (3-5) 

which in Banach spaces is equivalent to the fact that d is induced by a scalar product. 
p.5[) is satisfied by the geodesic distance on Riemannian manifolds of non-positive sectional cur- 
vature and characterizes the Alcksandrov non-positively curved (NPC) length spaces, see e.g. 

Actually, using (|3.5[) and adapting a Crandall-Liggctt argument, Mayer was able to prove 
(|1.26[) and then (|1.27[) also for geodesically convex functionals on NPC spaces. 

A crucial consequence of (|3.5p and the A-convcxity of <j) is that the generating functional 
<&(t, V; U) of the minimizing movement scheme (|2.3I I) 

$(r, V; U) := ^d 2 ([/, V) + </>(U) r > 0, U, V G X, (3.6) 

satisfies the r _1 + A-convexity condition along geodesies, i.e. 

the map U 1— > <&(t, V; U) is geodesically (r _1 + A)-convex for every V € X. (3.7) 

One of the main contributions of [3j Chapter 4] is to show that (|3 . T[) can be relaxed, by assuming 
the (t -1 + A)-convcxity of $(r, V; ■) along more general families of curves in X connecting two 
arbitrary points in D(<f>). 

This improvement has been essential to apply the generation result in Wasscrstcin spaces, which 
do not satisfy (|3.5p except for the 1-dimcnsional case. 

Theorem 3.3 (Convergence of the minimizing movement scheme and generation result [3j). Let us 
assume that the functional $ defined in ()3.6j) satisfies the following property: for every V,Uq,Ui £ 
D((j)) there exists a curve j s : [0, 1] — > X with 70 = Uq and 71 = U\, such that 

U 1 y <&(t, V;U) is ( — + A J -convex on 7 for each < r < (3-8) 



T 



i.e. 



Ht, V; 7,) < (1 - s)$(r, V; U ) + s*(r, V; U x ) - - s)d 2 (U , Ui). (3.9) 

zr 



('ij For every U® = Uq G f(0) a^rf r > 1 + tA > the minimizing movement method 
\2.12\ admits a unique solution ([/")„ 6 n C D(<p) 

(2) The corresponding discrete solutions U T converge to u as t I uniformly on compact 
intervals. 

(3) The limit u is the unique solution of flEVI,\D . In particular, <f> admits a X-gradient flow 
according to Definition \2.4\ thus satisfying all the properties stated in Theorem \2.6l 

(4) There exist universal constants C\ t x such that if uq € D(\d(f>\) the optimal error estimate 
holds: 

d(u(t),U T {t))<C x ,T\d4>\{uo)T for every t e [0,T]. (3.10) 
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The main arguments of the proof of Thcorcm l3.3l in a simplified setting can be found in [62] • Sub- 
optimal convergence estimates, inspired by the Crandall-Ligget approach, have also been obtained 
in a different way in [5] and in |23j . 

3.3. Generation results for geodesically convex functionals in spaces with a semicon- 
cave squared distance. In this subsection we consider a geodesically A-convex functional in a 
complete metric space (X, d) whose squared distance satisfies a semi- concavity condition. 

Definition 3.4 (Semi-concavity of the squared distance function). We say that D(<p) C X is a 
K.-SC (Semi- Concave) space if for every geodesic [0, 1] 3 s i— > v s £ D(<f) and for every w G D{4>) 
we have 

d 2 {v s ,w) > (1 - s)d 2 {v ,w) + sd 2 (ui,u;) - Ks(l - s)d 2 (v , Vl ) for every s G [0, 1]. (3.11) 
Examples of K-SC spaces 

-PC SPACES: X is positively curved (PC) in the Alcksandrov sense if and only if X is K-SC with 
K = 1. 

-Aleksandrov SPACES: if X is an Aleksandrov space whose curvature is bounded from below by 
a negative constant —k and D = diam(A") < +oo, then X is a K-SC space with K = — rr^-m ■ 
This class includes all Riemannian manifolds whose sectional curvature is bounded from below. 
-Product and L 2 -SPACES: if {(X^d,;)}^ is a countable collection of K-SC spaces, then the 
product Y[ X with the usual product distance is a K-SC space. If n is a finite measure on some 

separable measure space f2, then 3£ := L 2 (f2;X) = {/ : O — > X : J n d 2 (/(o>), xq) d/x(w) < 
+oo for some xo g X} endowed with the distance d\(f,g) = J a d 2 (/(o;), g(ui)) dfi(u>) is K-SC 
whenever X is K-SC. 

-Wasserstein space: (.9 > 2{X),W 2 ) is K-SC if and only if X is K-SC (see the next section). 

We will also assume that the (upper) angle between couple of geodesies emanating from the 
same point satisfies a suitable condition. 

Definition 3.5 (Upper angles). Let x 1 , x 2 be two geodesies emanating from the same initial point 
xq := Xq = Xq. Their upper angle < u (x 1 ,a; 2 ) g [0, 7r] is defined by 

, i 2 \ \ • f d 2 (xo,xl)+d 2 (x Q ,x 2 )-d 2 (xl,x 2 ) 
cos(< u (a; ,x )) := liminf — — — ^- 

8,t4.o 2d(xo,xl)d(xo,x 2 ) 

Definition 3.6 (Local angle condition (LAC)). We say that D(<f>) C X satisfies the local angle 
condition (LAC) if for any triple of geodesies x 1 : [0,1] — > D((f>), i = 1,2,3, emanating from 
the same initial point xq the corresponding angles 8 13 := <. u (x l , x :l ) satisfy one of the following 
equivalent conditions: 

1. e 12 + e 23 + e 31 < 2tt. 

2. There exist a Hilbert space H and vectors w l € H such that (w l , w j )h = cos^ 1 - 7 ) for 1 < i, j • < 3. 

3 

3. For any choice o/^ 1 ,^ 2 ,^ 3 > one has that £ cos(6» JJ )f t J > 0. 
Examples of (LAC) spaces 

-A Banach space X satisfies (LAC) if and only if X is a Hilbert space. 

-Riemannian manifolds and Aleksandrov spaces with curvature bounded from below satisfy 

(LAC). In particular if ([3~TT|) holds with K = 1 then X satisfies (LAC). 

-Product and L 2 -SPACES: satisfies (LAC) if and only if each (JQ,d,) does; L 2 (£l;X) 

iGN 

satisfies (LAC) if and only if X satisfies it. 

-Wasserstein space: ^(X) satisfies (LAC) if and only if X does. 

Theorem 3.7 (Generation theorem for geodesically A-convex functionals in K-SC and (LAC) 
spaces). Let (X, d) be a complete metric space and let (f> : (— oo,+oo] be a proper, l.s.c. and 
A- geodesically convex functional. 

(1) For every t, 77 > with 1 + tA > and U® t) = «o€ D((f>) the relaxed minimizing movement 
scheme p.34b .b) admits at least one solution (U™ v ) n <=N- 
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(2) If D (cf>) is a K-SC space and satisfies the (LAC) then the discrete solution U T „ converges 
to u as t 10 uniformly in each compact interval. 

(3) The limit u is the unique solution of QEVIaD . In particular, <f> admits a X-gradient flow 
according to Definition \2.4\ thus satisfying all the properties stated in Theorem \2.b\ 

4. WASSERSTEIN SPACES AND DIFFUSION EQUATIONS 

4.1. The Wasserstein space. Here, just to set the notation, we collect some basic definitions 
and properties of the Wasserstein space which will be used in the sequel. For a more detailed 
overview on this topic we refer to [67l [68] , [2] and [4] . 

Transport maps and couplings. We denote by Xj, for some i£N,a separable and complete metric 
space. 3P{X) is the space of Borel probability measures on X. 

If /i G 3P(X{) and t : Xi — > X 2 is a Borel map, we denote by t#/j, £ 3?{X-z) the push-forward 
of /.t through t, defined by 

t # n(B) := fi(t-\B)) for every B £ S3(X 2 ). (4.1) 

We denote by 77* , for i = l,...,n, the canonical projection operator from a product space 
X\ x ... x X n into Xi, defined by 

7T (x\, X n ) . X.[ . 

Given fi\ £ S?(X\) and /12 £ &*{X.%), the class r(/ii, 1x2) of transport plans or couplings between 
lii and [12 is defined by 

r(Mi,/i 2 ) := {7 6 &{X X x X 2 ) ■ 77^7 = /ii, tt|7 = fi 2 }. 

To each couple of measures n\ £ 3?{X\), [i 2 = t#\i\ £ SPiX?) linked by a Borel map t : X\ — > 
X2 we can associate the coupling 

fi := (ix 1 x t)#^i £ £P{X\ x X 2 ), ix x being the identity map on X\. (4-2) 

If fi is representable as in (|4.2|) we say that fi is induced by t and t is a transport map between /ii 
and fi2- Each coupling fi £ Y {^1,112) concentrated on a fi- measurable graph in X\ x X2 admits 
the representation ()4.2j) for some fii -measurable map i, which therefore transports \i\ into \i2- 

Wasserstein distance. Given a complete and separable metric space (X, d) we denote by ZP 2 {X) 
the space of Borel probability measures with finite quadratic moment: fi £ ZP{X) belongs to 
&i(X) iff 

J d 2 (.T,.T G ) dfj,(x) < +00 for some (and thus any) point x Q £ X. (4-3) 
For every couple of measures \i,v £ 3 g 2(X) we consider the Kantorovich problem for the cost d 

W|(£t, v) := min'l / d 2 (.x, y) dr/(x, y) : 7 G T(fi, v) \ . (4.4) 
Uxxx J 

It is not difficult to check, by the direct method of calculus of variations, that the minimum 
problem (|4.4I) admits at least a solution. The subset of v) given by the optimal transport 
plans for (|4.4I) will be denoted by r opt (^, v). Notice that if there exists 7 = x S T(/i, v), 
we have 

/ d 2 {x,y)dj(x,y) = d 2 (x, t(x)) d/^(x). 

JXxX JX 

The quantity W 2 ((J>,i / ) defined by (|4.4|) is a distance between the measures fi, v £ ,<3?2{X) which 
enjoys remarkable properties. 
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Theorem 4.1. Let (X, d) be a complete and separable metric space. Then, W 2 defines a distance 
on & 2 (X) and (& 2 (X), W 2 ) is a complete and separable metric space. Moreover, for a given 
sequence {pik\ken C & 2 {X) we have 

/d/x fc -> / fdfi for every f e C%(X) 
lim W 2 (fi k ,fi) = o { " X , Jx 



lim / d (x,Xo)dfjbk(x) = uniformly w.r.t. k £ N. 

«t+oo Jx\B R (x ) 

The metric space (^ 2 2(^), W2) is called the (£ 2 -) Wasserstein space on X. When X = R d we 
denote by (ffi d ) the subset of ^ 2 (R d ) defined by 

^ 2 Q (R d ) := {A* e ^ 2 (K d ) : A* < =^ d }- (4-5) 

Here we recall the following basic result on the existence and uniqueness of optimal transport 
plans induced by maps (which are then called optimal transport maps) in the case in which the 
initial measure pi belongs to 0>%(R d ). 

Theorem 4.2 (Existence and uniqueness of optimal transport maps, [38l Q2]). For any [i £ 

^2 and v £ &%(M. ), Kantorovich's optimal transport problem (|4.4|) has a unique solution 
7, which is concentrated on the graph of a transport map t. t is the unique minimizer of Monge's 
optimal transport problem on K d for the Euclidean distance 

\x — r(x)\ 2 dfj,(x) : r#/i = v 

The map t is cyclically monotone and there exists a convex open set ncR d with /j,(R d \ fi) = 
and a convex function (j) : f2 — > R such that t(x) = V(j)(x) for fi-a.e. x £ Q. 

Geodesies and curvature properties of ( (l^ d ) , W 2 )- 

Theorem 4.3 (Geodesies in the Wasserstein space). Given fi,v £ 3 g 2 (R d ) and 7 £ r opt (fi, v), 
the curve 

[0, 1] 3 S H> fl s = ((1 - S)^ 1 + S7T 2 ) # 7. 

is a constant speed geodesic between \x and v, i.e. it satisfies 

W 2 (ns,Ht) = \s - t\W 2 ([i , Hi) for every s,t £ [0, 1]. 

Vice versa, any constant speed geodesic between \x and v can be built in this way. 
If j = (i x t)#fi, then 

M s = ((l-s)z + st) #/ u, se[0, 1]. 

In particular, (&> 2 Q& d ), W 2 ) is a geodesic space. 

In view of the application to the Wasserstein framework of the theory of gradient flows in metric 
spaces developed in the previous section, we recall the following theorem (see Theorem 7.3.2 and 
Example 7.3.3 of [3]) 

Theorem 4.4 ((^* 2 (R d ), W 2 ) is a PC-space). For any /Uo> Mi> A*2 € & 2 (W l ) we have 

Wf Gu s ,/i 2 ) > (1 - s)Wi(fi ,^ 2 ) + sWUpuim) - s(l - s)W|(A«o,A*i) for every s £ [0, 1], (4.6) 
where /U s is any constant speed geodesic between /j,q and \i\ . 

Moreover, when d > 2 f/iere is no constant X £ R suc/i i/iat W 2 (-, pb 2 ) is X-convex along geodesies. 



According to Alcksandrov's notion of curvature for metric spaces, (|4.6j) can be interpreted by 
saying that the Wasserstein space is a positively curved metric space (or PC-space). 
Then, the square of the Wasserstein distance along geodesies does not satisfy the 1-convexity 
assumption (|3.5[) . which would be the most natural to prove the generation Theorem 13.31 for the 
gradient flows of A-convex functionals. 

However, the theory developed in the previous section allows for a great flexibility in the choice 
of the connecting curves. In particular, for the Wasserstein space on M d the 1-convexity property 
p.5j) is satisfied along the following class of curves: 
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Definition 4.5 (Generalized geodesies). A generalized geodesic joining // 2 to /Z3 (with base point 
fii) is a curve of the type 

[0, l]3m , = ((1 _ s)7r 2 + sn 3 h ^ (4 7) 

where 

/J, G r(/ii,/i 2 ,A*3) tt 1,2 #M G r op t(/ii,/i 2 ), 7r 1,3 #At e r opt (^i,Ai3)- (4.8) 



i/ere r(^i, /i 2 , /J3) := {7 £ ^(R d x R d x M d ) : 7^7 = i = 1,2,3} and ir™ : M. d x 
R d x R d «s i/ie projection on the i-th and j-th coordinate. 



pd 



Proposition 4.6 (1-convexity of the Wasserstein distance along generalized geodesies). Let pt\ , \ii , /X3 € 

^2(K d ) and ?e£ /x € r(/ii, // 2 , ^3) suc/i £/ia£ tt^A* € ropt(£*i, /Xj), /or i = 2,3. Then, 

W 2 2 (/i^ 3 ,/ii) < (1 - s)Wf Gui,M2) + sW 2 2 ( Ml ,M3) - «(1 - s)W 2 2 (m 2j /x 3 ) /or e«en/ s e [0, 1]. 
In particular, the function hW^ (fJ*i, ■) is 1-convex along generalized geodesies. 

4.2. Absolutely continuous curves in (^ 2 (R d ), W2). We recall here some basic properties 
of absolutely continuous curves in the Wasserstein space, which are related to the "dynamic 
interpretation" by Benamou-Brenier [9]. The main result is the following [3) Theorem 8.3.1]: 

Theorem 4.7 (Absolutely continuous curves and the continuity equation). Let pit ■ (0, +00) — > 
^ 2 (K d ) be an absolutely continuous curve and let \/j,'\ € L 1 (0, +00) be its metric derivative. Then 
there exists a Borel vector field v : (x,t) v t {x) such that 

v t e L 2 ([i t ;R d ), \\v t \\ L2(lltm < for ££ x -a.e. te (0,+co) (4.9) 

and the continuity equation 

+ V • (v t fxt) = mR d x(0,+w) (4.10) 

at 



holds in the sense of distributions. 
Moreover, 



-L 2 (n t ;m d ) 



v t e {V^: v?S C* c °°(R d )} /or -a. e. i e (0, +00). (4.11) 

Conversely, if a curve (0, +00) 5 t 1— > fj,t € ^* 2 (R ) is continuous w.r.t. the weak topology on 
&(M. d ) and it satisfies the continuity equation (|4.10p for some Borel vector field v t with 

1 1 1>t Wwfot-Bd) < +OO, 



then lit is an absolutely continuous curve and |/x'|(£) < |z, 2 (/^ t ;M d ) for J?? 1 -a. e. t £ (0,+oo). 

Then, the minimal norm for the vector fields v t satisfying (|4. 10[) for an absolutely continuous 
curve [it is given by its metric derivative. Furthermore, such "minimal" vector fields satisfy (|4.1ip . 
This fact suggests the following definition (we refer to O Chap. 8.4]). 

Definition 4.8 (Tangent space). Let fi € ^ 2 (R d ). We define the tangent space to ^ 2 (M d ) at 
the point fi as 

Tan M ^ 2 (R rf ) := {V<^ : <p i C~(R d )} L ~ (Mt:R '' ) . (4.12) 

Proposition 4.9 (Tangent vectors to absolutely continuous curves). Let n t : (0,+oo) -> ^ 2 (R d ) 
be an absolutely continuous curve and let v t G L 2 (/j rt -M. ) be a Borel vector field such that (|4.10l) 
holds. Then v t satisfies (|4.9|) if and only if v t € Tan Mf ,9 > 2 {^ d ) for ££ x -a.e. t e (0, +00). The 
vector v t is uniquely determined J^ 1 - a. e. by (|4.9I) and (|4.10[) . 

Tangent vector fields are also strictly related to the first order infinitesimal behavior of the 
Wasserstein distance along absolutely continuous curves. 



20 



SARA DANERI AND GIUSEPPE SAVARE 



Proposition 4.10. Let fx t : (0, +00) — > ^jO^ ) be an absolutely continuous curve and let v t G 
Tan Mt &2( 1 ^ d ) be the tangent vector characterized by Proposition \4-9\ Then, for ^f 1 -a.e. t G 
(0, +00) the following property holds: 

lim W 2 { l H +h ,(i + hv t )^) = Q 

h^O \h\ 

Then, if fi t and ^t+h are linked by an optimal transport map t^' t +h , we have 

lim t ^ t+ h - 1 = v t m L 2 (fi t ;R d ). 

As an application of (|4. 13[) we are able to show the J2f 1 -a.e. differentiability of 1 1-> ^(//^cr) 
along absolutely continuous curves \it in terms of tangent vectors and optimal transport plans; 
this provides a useful formula for the left hand side of the flEVI,\| ) 

\-M W %(l J 't,p) < </>(<*) - 0(A*t) for every a G D(4>) 

Theorem 4.11. Let [it ■ (0, +00) — > S 2(^- d ) be an absolutely continuous curve, letvt G Tan^ t ^2(R d ) 
be the tangent vector characterized by Proposition ^ -9\ and let a G tP 2 {X). Then 

Id f 

17^(^,0-)= (x - y,V t (x)} d-y(x,y) for every 7 G T opt {fJ,t, cr), (4.14) 

for Jz^-a.e t G (0, +00). In particular, if fj, t G ^(R d ), 



1 d 



A -W 2 ^ t ,a)= (x-t^(x),v t {x))d^(x), (4.15) 
z at j^d 

where t a is the unique optimal transport map between fi t and a. 

4.3. Geodesically A-convex functionals in .^(R'O- hi this section we introduce the three 
main classes of A-geodesically convex functionals on the Wasserstein space (.^(R'O, W2) intro- 
duced by McCann [47] (for the proofs of the main results we refer to Chapter 9 of [3]). 

Example 4.12 (Potential energy). Let V : M. d R be a Ay-convex function for some Ay € K 
and let us define the potential energy 



r(/x) := / V(x)d(j,(x) for every fx G 0* 2 (R )• (4.16) 
For every fii, fi 2 G D(Y) and fi G T(fii, (12) we have 

^([(l-sy + STr 2 ] /x) < (l- s )r( M x) + S r(/i 2 )-^s(l- S ) / Ix-ypd/x^y). (4.17) 
In particular, "V is geodesically Ay-convex on &2(^ d )- 

Example 4.13 (Interaction energy). Let Xw < and let W : R d — > R be a Aiy-convex function 
with W(—x) — W(x) for every x G R d , and let us set 

:=- [[ W(x ~ y) dfi(x) d/i(y) for every fi G ^ 2 (R d )- (4.18) 



2 _ 

For every fii,fi2 G D(W) and /i. G r(/ii,/i2) we have 

W ( [(1 - .sV 1 + stt 2 ] M ) < (1 - s)>T(/ii) + S #> 2 ) -^s(l-s) [ \x-y\ 2 d»(x, y). (4.19) 
In particular, is geodesically Aiy-convex on ^ 2 2(R d )- 

Example 4.14 (Internal energy). Let U : [0, +00) — > R be a convex function such that 

U(0) = 0, liminf Hill > —00 for some a > — — , lim = +00 (4.20) 

w sio s a d + 2 s^+oo s 

the map s t-> s d U(s~ d ) is convex and non-increasing on (0, +00). (4-21) 
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The internal energy functional 

*&.)-( L^M^M "K^-'-iS* (4.22) 
+00 otherwise. 

is geodesically convex and lower semicontinuous in £?2(R d )- Among the functionals °i/ with U 
satisfying (j4~20l) and (|4~2T|) we have 

the entropy functional U(s) = s log s (4.23) 

„m 

the power functional U(s) = -, m > 1. (4.24) 

m — 1 

Property (|4.21[) is also satisfied in the range 1 — ^ < m < 1; however, since in this case U is 
not superlinear at infinity (the third condition of (|4.20[) ), we have to consider the relaxed lower 
semicontinuous functional 

&*(»)=[ -^—d^f d , if u = p& d + u ± , J_J? d . 
J Rd m-l 

Example 4.15 (Relative entropy). Let fi, 7 be two measures in ^(R^). Then, the relative entropy 
of /1 w.r.t. 7 is the functional defined by 

!f d/j, dfx 
i^ l0g ^ d7 ' lf ^ <<7 ' (4.25) 
+00 otherwise. 

We note that (|4.25[) corresponds to the internal energy associated to the function (|4.23l) when 
7 = Jzf d (which nevertheless is not in ^(R d )). 

Proposition 4.16. The relative entropy Ent 7 is geodesically convex in 0> 2 (R d ) if and only if one 
of the following conditions holds: 

1. 7 = e~ v Jz? d for some convex function V : R d -> R; (4.26) 

2. 7 is log-concave, i.e. for every couple of open sets A, B C R d , t € [0, 1] 

log 7 ((l -t)A + tB) > (1 -t) log y(A) +t log 1 {B). (4.27) 

Now we introduce the notion of convexity which will be crucial to apply the metric theory of 
gradient flows developed in Section [3] to the main examples 14.121 14.131 14.141 14.151 of geodesically 
A-convex functionals in the Wasserstein space. 

Definition 4.17 (Convexity along generalized geodesies). Given A € R, we say that <j> : ^(R^) — > 
(—00, +00] is X-convex along generalized geodesies if for any ^i,^t2,M3 ^ D{4>) there exists a 
generalized geodesic [0, 1] 3 s h> A 4 ^ 3 joining /i2 to [13 induced by a plan /j, satisfying (|4.8[) such 
that 

<f>(f4~^ 3 ) < (1 - s)<f>{n2j + s<f)(n 3 ) - ^s(l - s) / |x 2 - Z3| 2 d/A(xi,x 2 ,x 3 ) /or every s e [0, 1]. 

(4.28) 

Lemma 4.18. [(1/r + X) -convexity of <3?(t, /ii; •)] Let : <^2(R d ) — > (—00, +00] &e a proper 
functional which is X-convex along generalized geodesies for some A 6 R. Then, for each \i\ € D(<t>) 
and for each < r < the functional 

5?(t, fii; fx) := — W^ifJ*!, (J>) + <fi(n) satisfies the convexity assumption (|3 . 8[) . 
2r 

By Lemma l4.18[ whenever <j> is proper, l.s.c. and A-convex along generalized geodesies in tP^^R ) 
we can apply Theorem 13.31 and get the existence, uniqueness and regularizing estimates for the 
solutions of the EVIa • 

The examples of geodesically convex functionals in £?2(R d ) which have been introduced in this 
section are also convex along the generalized geodesies. 
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Theorem 4.19. The junctionals on 3 g 2(^ d ) considered by the examples \4-12\ \^T3\ (with A < 0), 
\4-H\ (under condition (|4.21[) ), and \4.lS\ (under condition (|4.27|) ) are X-convex along generalized 
geodesies. 

4.4. Gradient flows in (.^(B^), W 2 ) and evolutionary PDE's. In this section we show some 
applications of the generation result [3~3l to the existence, well-poscdncss, and asymptotic behavior 
of nonnegativc solutions p : R d x (0, +00) — > K of evolutionary PDE's of the type 

J^-p-V- (pV^ ) =0 in R d x (0,+oo), (4.29) 



dV V Sp 

where S is the first variation of a suitable integral functional; here we consider the case of 
functionals which are a positive linear combination of the three kinds of contributions considered 
in the Examples |4"7T2"1 14351 and l4"7Hl i.e. <j>(p) := ay<%{p) + a 2 f(p) + a 3 W(p) where q, > and 



W{p) := / U{p{x))dx, 

JR d 

V(p) := I V{x)p{x)dx, (4.30) 



W{p) ■=- [ W{x-y)p{x)p{y)dxdy, 

so that 

=aiU'(p) + a 2 V + a 3 W* p. (4.31) 

dp 

In the particular cases of the Fokker-Planck equation ((f) = + Y and U(r) = rlogr) 

d 

— p- V ■ (Vp + pW) = in R d x (0,+oo), (4.32) 
and of the nonlinear diffusion equations (4> = °i/ , U(r) = ^zr[r m ) 

^p-Ap" l -0, m>l-i (4.33) 

the Wasserstein approach has been introduced by the remarkable papers of Jordan-Kinderlehrer- 
Otto [36j and Otto [56] and then extended in many interesting directions, covering a wide range 

of applications: see e.g. El Q] Q21 EH EHJ 133 EH HI HOI H ED 131 El HI 113 SH] ■ 

The results presented here are just examples of the transport approach. 

Theorem 4.20. Let V, W, U be as in the examples \4~l^\4A^ and \4~AJ\ let V, W, a l/ be defined as 
in (|4.30j) . and let (f> := a\ ^ + ol 2 V + ot 3 W . For every pq G 0^ 2 {W l ) there exists a unique solution 
p t G Lip loc (0, +00; 0> 2 (R d )) satisfying EVI A (^ 2 (K d ), W 2 ,<f>), A := a 2 X v + a 3 X w , 

l^WKpuO-) <<Ko)-<I>(jh)-^W2Qh,o) for every a G D(</>) (4.34) 

with lim p t = Po in ^ 2 (K d ); the curve p satisfies all the properties stated in Theorem \ 2.6l the 
continuity equation 

d 

— p t + V • (p t v t ) = ffll d x(0,+oo), with (4.35) 

v t G Taiv t ^ 2 (K d ) JSf 1 -a. e. m (0, +00) and t h-> / |u t | 2 dp t = |p;| 2 € L^ c (0, +00), (4.36) 

and for J£ l -a.e. t G (0, +00) i/ie velocity vector v t £ Tan Mt ^2(K rf ) satisfies the "subdifferential 
inequality'' 

I (v t {x),x-y) + ^-\y-x\ 2 d'y t (x 1 y)<<j){o-)~<j)(p t ) for every j t e T opt (p t ,o-). (4.37) 
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We can give an explicit characterization of the system (|4.35[) . (|4.37|) . Here we consider the 
simpler case when U, V, W are diffcrcntiable and satisfy a doubling condition: for a function 
/ : M. h — > R it means that there exists a constant C > such that 

f(x + y)<C(l + f(x) + f(y)) for every x,y€R h . (4.38) 

We also set 

Lu{r) := rU'{r) - U(r) if r > 0, L v (0) = 0. (4.39) 

Theorem 4.21. Under the same assumptions of the previous theorem, let us also suppose that 
U, V, W are differentiate and satisfy the doubling condition (|4.38|) . The locally Lipschitz curve 
p characterized by (|4.34l) (or by (|4.35[) . (|4.37|) ) solves the following evolutionary PDE's in R d x 
(0,+oo) 

Transport equation, <fi = y , v t = — W: 

|-/it - V ■ (/i t W) = 0. (4.40) 

Ot 

Nonlocal interaction equation, <f> = W, v t = — (VW) * pt 

^-p t -W-(p t (WW*p t )) = 0. (4.41) 
Of 

Fokker-Planck equation, <j> = U(r) = rlogr, —ptVt = V/M + Pt^V 

■f- p t - V • (V/xt + /itVF) = 0. (4.42) 

In £/iis case, ^ = p t ^ d with p t G W^' c 1 (M d ) /or J^-a.e. t G (0, +oo). 

Nonlinear diffusion equation, <f) = , p t v t = —S7L(j(p t ) where pt = Pt-^ d <C Jz? d , 

jIH-A(L u (jh))=0, (4.43) 

with Lu(pt) G W£,!(M d ) /or jSf^a.e. i G (0, +oo). 

Drift- diffusion with non local interactions, <fr = & + "f + , —ptVt = VLu(pt) + (J-tVV + 
Pt{{VW) * p t ), p t = Pt ^ d « ^ d , 

^p t - V • {VLu(pt) + p t VV + ih((VW) * p t )) = (4.44) 

Ot 

We refer to [3l Chap. 11] for the proofs and for more general and detailed results; here we just 
give a sketch of the argument showing that (|4.35l) . (I4.37[) yield (|4.42p when <f> = & + V in the 
case of U(r) = rlogr. 

Let us fix a time t > where (|4.37[) holds, a smooth test function £ € C^°(M. d ), and t e := i+eWC- 
If |e|max R d ||D 2 £j| < 1 the coupling 7 £ := (i,t e )^p t is optimal between p t and (t e )#p t so that 
(1437) yields 

-e / (« t (a?),VC(x))d^t(a?)<^((t e ) # /it)-^(A(t). 

Setting 

d/it e _ d(i £ ) # /i t 



dJz? d ' ^ ' dJz? rf 



we get 

— £ 



f (v t ,VQdp t < [ p\\ogp\A^ d -\ p t \ogp t dJ? d + [ (V(t £ (x))-V(x))dp t (x) 
Jw JR d Jm d Jw d 



Applying the change of variables formula 

p e t (t e (x))det[i + £D 2 ((x)] = Pt (x), 

we obtain 

-£ [ (v t ,V0dpt < - f p(a;)log(det[i+eD 2 C(x)])d^f d + f (V(t E (x))-V(x)) dp t (x) (4.45) 
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Finally, dividing by e and taking the limit of (|4.45|) as e tends to we get 

-/ (v t ,VOdlH= [ ( - AC(s) + VV(x) ■ VC) dfi t for every ( G C^°(M. d ), 
so that \i satisfies the distributional formulation of (|4.42j) . 

4.5. The heat flow on Riemannian manifolds and metric-measure spaces. We conclude 
these notes by giving a short account of possible applications of the Wasserstein setting to the 
generation of the heat flow in Riemannian manifolds and metric-measure spaces. 

Let us start with a compact and smooth Riemannian manifold (M, g)\ we denote by d g its 
Riemannian distance and by 7 = Vol s G &(M) its (normalized) volume measure. 

In {P 2 (M) we consider the Relative Entropy functional Ent 7 as in (|4.25[) . Von Renesse-Sturm 
j66] proved 

Theorem 4.22. The Relative Entropy functional Ent 7 is geodesically X-convex in ^^(jVf) if and 
only if M satisfies the lower Ricci curvature bound 

Ric(M) > A i.e. Ric x (v, v) > X\v\ 2 g for all x G M and v G Tan x (M). (4.46) 

In this case, it is possible to show (see [55j Ell [27l HH EH1 E2]) that the Relative Entropy 
functional Ent 7 generates a A-gradient flow S t ■ &%(M} — > &<x(M) according to definition 12.41 
which coincides with the classical heat flow on M. 

Theorem 4.23. The relative entropy functional Ent 7 generates a X- gradient flow St in S^i^M) 
according to Definition \2.4\ (and thus satisfying all the properties stated in Theorems \2. 6\ and \2.15\) . 
A curve \i t G @ > i(M') is a solution of EVIa(M, d g , Ent 7 ) if and only if its density p t = d/x t /d7 
solves the Heat equation 

d 

— Pt ~A gPt = inMx(0,+oo), 

where A g is the Laplace- Beltrami operator on M . 

The adimensionality of the form of the Entropy functional f|4. 25[) and the purely metric character 
of the EVI suggest that one can use them to define a heat flow on more general measure-metric 
spaces (X, d, 7), where (X, d) is a complete and separable metric space and 7 G &(X). Indeed, as 
it has been often pointed out in the previous sections, the EVI formulation gives nice regularity, 
stability and asymptotic properties for the related flow. We briefly sketch two possible approaches: 

Approximation by measured Gromov-Hausdorff convergence. We consider a sequence of smooth 
and compact Riemannian manifolds (M h , d h , Vol' 1 ) converging to a limit measure-metric space 
(X, d,7) in the measured Gromov-Hausdorff convergence: it means [51] that a sequence {d' l }/c 6 N 
of (complete, separable) coupling semidistances on the disjoint union M h U X exists such that the 
restriction of d h on M h (resp. X) coincides with d h (rcsp. d) and 

lim W£(Vol h , 7) = 0, W2 is the Wasserstein distance on &> 2 (M h U X) induced by d' 1 (4.47) 

A sequence pf 1 G 3?i{M h ) converges to fj, G &i{X) if lim^+oo W£ (fji h , fi) = 0. Adapting the 
arguments of Theorem 12. 171 it is possible to prove the following asymptotic result: 

Theorem 4.24 ([63]). Let us assume that the compact Riemannian manifolds M h satisfy the 
uniform lower bound on the Ricci curvature Kic(M h ) > A for some X G M independent of k 
and converge to (X,d,j) in the measured Gromov-Hausdorff sense. Then the Relative Entropy 
functional Ent 7 admits a X-gradient flow St on ^2 (X) and for every sequence of initial measures 
fig G 3 g 2{M h ) converging to no G £^2 {X) the corresponding solution /xj 1 of the Heat flow on M h 
converges to St(no) in £?2 (X) for every t > 0. 

Applying Theorem l2.9l one finds in particular that the limit Entropy functional Ent 7 is strongly 
geodesically A-convex (at least when the support of 7 is X), a stability result that has been proved 
by [651 S3- 
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Intrinsic costruction. Starting from Theorem 14.221 Sturm [55] and Lott-Villani [12] intro- 
duced the concept of metric- measure spaces {X, d, 7) satisfying a lower Ricci curvature bound, by 
requiring that the relative entropy functional Ent 7 is gcodesically A-convex in &2 (-^0- 

Definition 4.25 (Lower Ricci curvature bounds for metric-measure spaces). We say that a metric- 
measure space (X, d,7) has Ricci curvature bounded from below by a certain Agl (and we write 
Hic(X) > X) if the relative entropy Ent 7 is X-geodesically convex on X . 

It is then natural to look for other intrinsic properties of X which are sufficient to deduce the 
existence of the associated EVI semigroup. It is interesting to notice that if the relative entropy 
functional generates a A-gradient flow St then St is a semigroup of linear operators [63j . In the case 
of compact positively curved (PC) Alexandrov spaces the existence of a A-contracting gradient 
flow can be deduced by a general unpublished result of [51] and has been recently proved by Ohta 

In more general cases, we can apply Theorem 13.71 

Theorem 4.26. Let us suppose that (X, d,7) is a complete and separable metric-measure space 
with Ricci curvature bounded from below, according to Definition \4-25\ and measure 7 with full 
support supp(7) = X. If X satisfies the Local Angle Condition \3.6\ and it is K-semiconcave as in 
\3.4\ then the relative entropy functional Ent 7 generates a X-gradient flow on &2{X) which can 
be uniquely extended to a Markov semigroup (i.e. linear, order preserving, strongly continuous, 
contractive) in every space L p ("f), p g [1, +00). 
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